Analyzing and revising data integration schemas to improve their matchability
نویسندگان
چکیده
Data integration systems often provide a uniform query interface, called a mediated schema, to a multitude of data sources. To answer user queries, such systems employ a set of semantic matches between the mediated schema and the data-source schemas. Finding such matches is well known to be difficult. Hence much work has focused on developing semi-automatic techniques to efficiently find the matches. In this paper we consider the complementary problem of improving the mediated schema, to make finding such matches easier. Specifically, a mediated schema S will typically be matched with many source schemas. Thus, can the developer of S analyze and revise S in a way that preserves S’s semantics, and yet makes it easier to match with in the future? In this paper we provide an affirmative answer to the above question, and outline a promising solution direction, called mSeer. Given a mediated schema S and a matching toolM , mSeer first computes a matchability score that quantifies how well S can be matched against using M . Next, mSeer uses this score to generate a matchability report that identifies the problems in matching S. Finally, mSeer addresses these problems by automatically suggesting changes to S (e.g., renaming an attribute, reformatting data values, etc.) that it believes will preserve the semantics of S and yet make it more amenable to matching. We present extensive experiments over several real-world domains that demonstrate the promise of the proposed approach.
منابع مشابه
Analyzing and Revising Mediated Schemas to Improve Their Matchability
Data integration systems often provide a uniform interface, called a mediated schema, to a multitude of disparate data sources. To answer user queries posed over the mediated schema, such systems employ a set of semantic matches between this schema and the local schemas of the data sources. Finding such matches is well known to be difficult. Hence much work has focused on developing semi-automa...
متن کاملPredicting Inefficient Problem-Solving Methods based on Early Maladaptive Schemas in Drug-Dependent Individuals
Objective: The aim of this study was to predict the inefficient problem-solving methods of drug-dependent individuals based on early maladaptive schemas. Method: This study was a descriptive-correlational research. The statistical population of the study included all drug-dependent individuals referred to addiction treatment camps in Qom in 2019. The statistical sample of the study was 270 drug...
متن کاملThe effectiveness of MCT, on maladjusted Schemas among divorced women
Abstract Divorce and its effects on the family system in recent years increasingly been the focus of psychological research, in line to the researches, this research was done to evaluate effectiveness of metacognitive therapy on maladaptive schemas among divorced women. The method of research was quasi- experimental design, with pretest-posttest and control group. For sampling 30 women who ...
متن کاملLearning Styles and the Writing Process in a Digitally Blended Environment: Revising, Switching, and Pausing Behaviors in Focus
The present investigation sought to explore the relationship between learning styles and writing behaviors of EFL learners in a blended environment. It also aimed to identify the learning style types best predicting writing behaviors. Initially, the participants' preferred learning styles were identified through the Kolb’s learning style inventory (Kolb, 1984). Secondly, data were obtained thro...
متن کاملThe Effectiveness of Emotionally Focoused Couple therapy on their Emotional Schemas of Young Couples
Introduction: The aim of this study was to the effectiveness of emotion- focoused couple therapy on the emotional schemas of young couples. Methods: The research method was semi-experimental to pretest-posttest. The statistical population consisted of all young couples with adjustment problems who referred to psychological service and counseling centers in Tehran in the second half of 2019 (N=...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 1 شماره
صفحات -
تاریخ انتشار 2008